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Members of the virulence-associated protein (Vap) family from the pathogen 
Rhodococcus equi regulate virulence in an unknown manner. They do not share 
recognizable sequence homology with any protein of known structure. VapB 
and VapA are normally associated with isolates from pigs and horses, 
respectively. To contribute to a molecular understanding of Vap function, the 
crystal structure of a protease-resistant VapB fragment was determined at 1.4 A 
resolution. The structure was solved by SAD phasing employing the anomalous 
signal of one endogenous S atom and two bound Co ions with low occupancy. 
VapB is an eight-stranded antiparallel /3-barrel with a single helix. Structural 
similarity to avidins suggests a potential binding function. Unlike other eight- or 
ten-stranded /3-barrels found in avidins, bacterial outer membrane proteins, 
fatty-acid-binding proteins and lysozyme inhibitors, Vaps do not have a next- 
neighbour arrangement but consist of two Greek-key motifs with strand order 
41238567, suggesting an unusual or even unique topology. 



PDB reference: R. equi VapB, 4cv7 



1 . Introduction 




Rhodococcus equi is a Gram-positive facultative intracellular 
pathogen mainly of young foals, but also of other animal species and 
of human AIDS patients. R. equi is closely related to mycobacteria 
such as Mycobacterium tuberculosis and causes life-threatening 
pulmonary and extra-pulmonary infections (Vazquez-Boland et at, 
2013; von Bargen & Haas, 2009). 

Epidemiological studies have shown that infection of different 
animal species is closely coupled to the possession of certain types of 
virulence-associated plasmids by the infecting bacteria. The plasmid 
type typically found in equine isolates mediates production of the 
immunodominant secreted protein VapA (virulence-associated 
protein A; Sekizaki et ah, 1995). Porcine isolates typically produce the 
allelic variant VapB (Takai et a!., 2000). vap A and vapB have never 
been described together in a single bacterial isolate. VapA production 
is absolutely required for virulence phenotypes in a mouse infection 
model and in murine macrophages (Jain et al., 2003) and it is also a 
prerequisite for the neutralization of phagosome and lysosome pH in 
infected macrophages (von Bargen et al. , 2009). An additional ten vap 
genes and three vap pseudo-genes identified on the vapA- and vapB- 
encoding virulence plasmids seem to have only a minor role (if any) 
in virulence (Jain et al., 2003). The amino-acid sequence similarities 
between all Vap proteins according to ClustalW are typically in the 
30-50% identity range, with the 78% identity score between VapA 
and VapB being particularly high (Letek et al., 2008). 

The precise mechanisms of how Vaps support R. equi virulence is 
unknown, particularly as there is apparently no protein to be found in 
current amino-acid sequence databases that has a meaningful 
sequence identity to VapA and for which the function has been 
annotated. We reasoned that establishing the three-dimensional 
structure of a Vap protein would possibly hint at the putative func- 
tion^) of the Vap family. Here, we present the crystal structure at 
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1.4 A resolution of the 12 kDa C-terminal protease-resistant core of 
recombinantly produced VapB. 

2. Materials and methods 

2.1. Protein expression and purification 

Mature R. equi VapB (residues 36-197 lacking the signal sequence) 
was generated by PCR from the sequenced virulence-associated 
plasmid of strain PAM 1593 (Letek et at, 2008). The PCR primers 
were constructed in such a way that they generated an N-terminally 
His 6 -tagged VapB protein when cloned into the pETite N-His Kan 
vector (BioCat, Heidelberg, Germany) by 'in-fusion reaction', 
according to the manufacturer's manual. The primers were 
VapB_his-6_forward, 5'-CATCATCACCACCATCACGTGCTG- 
GATTCCGGAGGCGGC-3', and VapB_his-6_reverse, 5'- 
GTGGCGGCCGCTCTATTATTATGCAACCTCCCAGTTGTG-3'. 
Protein expression was induced in Escherichia coli BL21 (DE3) cells 
at an OD 60() of 0.76 with 1 mM IPTG for 4 h at 37° C. Protein puri- 
fication was carried out at 4°C. The cell pellet from 2 1 of bacterial 
culture was resuspended in 40 ml lysis buffer (50 mM NaH 2 P0 4 , 
300 mM NaCl, 10 mM imidazole pH 7) plus one EDTA-free 
cOmplete protease-inhibitor tablet (Roche) and 3 ul DNase I 
(40 mg ml -1 ) and lysed by three passes through a French pressure 
cell. The lysate was cleared by centrifugation and the supernatant was 
applied onto 10 ml Ni-NTA Sepharose equilibrated in lysis buffer. 
The resin was washed with 3 x 30 ml lysis buffer (pH 7) containing 
10 mM, 20 mM or 40 mM imidazole. The target protein was eluted 
with lysis buffer including 500 mM imidazole (pH 7). The eluate was 
dialyzed against 2 1 phosphate-buffered saline (PBS) twice. The yield 
was roughly 80 mg protein per litre of bacterial culture. For crystal- 
lization, the protein was digested with proteinase K (Roth, product 
No. 7528.1). 40 mg of VapB were digested with 80 ug proteinase K 
(500:1 proteimprotease mass ratio) in 40 ml PBS for 90 min at 37°C. 
The reaction was stopped with 400 ul PMSF (phenylmethylsulfonyl 
fluoride) (17 mg ml -1 in ethanol; 100 mM) on ice for 20 min and then 
the solution was dialyzed against 1 1 buffer A (25 mM Tris, 20 mM 
NaCl pH 7), plus 1 ml PMSF (100 mM in ethanol) overnight with a 
protein recovery of about 50% (20.6 mg). Proteinase-K-digested 
VapB was further purified by anion-exchange chromatography over 
Source Q (GE Healthcare) resin equilibrated in buffer A. The protein 
was eluted by a salt gradient to 1 M NaCl. SDS-PAGE analysis 
revealed that proteinase K digestion had resulted in two fragments of 
very similar molecular mass, of which the slightly heavier fragment 
eluted earlier. Early peak fractions were pooled, dialyzed against 
buffer A overnight, concentrated to 16.8 mg ml -1 using Vivaspin 
concentrators (Sartorius) and frozen in small aliquots. The recovery 
after proteinase K digestion and purification was about 12.5% (5 mg). 

2.2. Crystallization 

Crystals grew in a sitting-drop vapour-diffusion setup in the initial 
sparse-matrix screen (condition G4 of The JCSG Core Suite II; 
10 mM CoCl 2 , 100 mM sodium acetate pH 4.6, 1.0 M hexanediol) 
with 10 mg ml -1 protein in buffer A at 20°C with a drop consisting of 
1 ul protein solution and 0.5 ul reservoir solution. Crystals were 
observed after 23 weeks (the previous inspection being at 15 weeks), 
presumably well after the crystallization drop had reached equili- 
brium. Single crystals could easily be isolated for mounting from a 
bundle of thick (40-50 um) needles reaching 200-300 iim in length. 
The crystal used for phasing was cryoprotected with 10 mM CoCl 2 , 
100 mM sodium acetate pH 4.6, 1.1 M 1,6-hexanediol, 20% glycerol. 
The crystal used for high-resolution data collection was cryopro- 



Table 1 

Data collection and processing. 



Values in parentheses are 


for the outer shell. 




Data set 


Phasing 


Refinement 


Diffraction source 


Microfocus sealed tube 


ESRF beamline ID29 


Wavelength (A) 


1.5418 


0.91985 


Temperature ( n C) 


— 173 


— 173 


Detector 


Atlas CCD 


PILATUS 6M 


Crystal-to-detector 


52 


286 


distance (mm) 






Rotation range per 


0.35 or 0.5 


0.3 


image (°) 






Total rotation range 


1379 


360 


O 

Space group 


P6i22 


P6i22 


a, b, c (A) 


83.55, 83.55, 49.28 


83.82, 83.82, 49.13 


IVIosaicity (° J 


0.15 


0.10 


Resolution range 


15-1.9 (1.95-1.90) 


50-1.4 (1.44-1.40) 








Total No. reflections 


1113900 (26028) 


774043 (43740) 


No. unique reflections 


15146 (llll)t 


20584 (1499)t 


Completeness (%) 


99.7 (100) 


100 (100) 


Multiplicity 


73.5 (23)f 


37 (29)t 


(7/<t(/)) 


49.7 (5.6) 


26.9 (2.7) 


«» ra ,§ (%) 


13.8 (71.6) 


9.6 (126.4) 


CC 1/2 


100.0 (92.7) 


100.0 (85.2) 


Overall B factor from 


20 


22 


Wilson plot (A 2 ) 






t Friedel mates separate. 


t Friedel mates merged. § 


^meas as defined in Diederichs 


& Karplus (1997): R, 


.»*, = Y ll ,u{N(hkl)/[N(hk!) 


-l]) ll2 Y.Mhkl)-Whkl))\l 



tected with 10 mM CoCl 2 , 100 mM sodium acetate pH 4.6, 2.3 M 1,6- 
hexanediol. Both crystals were flash-cooled in liquid nitrogen. 

2.3. Data collection and processing 

Data sets were collected from two different crystals from the same 
crystallization drop. High-redundancy data for SAD (single-wave- 
length anomalous dispersion) phasing were collected on a Super 
Nova diffractometer (Agilent) equipped with a 40 W copper sealed- 
tube microsource and an Atlas CCD detector. High-resolution data 
for refinement were collected on ESRF beamline ID29 at a wave- 
length of 0.91985 A on a PILATUS 6M detector. All data were 
indexed and integrated with XDS and scaled with XSCALE (Kabsch, 
2010). Data-collection and processing statistics are given in Table 1. 

2.4. Structure solution and refinement 

2.4.1. Substructure solution. The substructure was solved with 
SHELXD (Sheldrick, 2010) run through the graphical user interface 
HKL2MAP (Pape & Schneider, 2004) for several thousand trials with 
different high-resolution cutoffs, and between two and four requested 
sites. Equivalence of the resulting substructures was checked with 
phenix.emma (Adams et at, 2010). Solutions with CC a u and CC weak of 
around 50 and 30, respectively, yielded three consistent sites. 

2.4.2. Density modification and polyalanine model. Density 
modification was carried out with the /i-test version of SHELXE 
including autotracing. SHELXE (Sheldrick, 2010) was run with 20 
cycles of density modification, a solvent fraction of 0.45 and a high- 
resolution cutoff for the native data of 1.9 A (the synchrotron data 
only became available later) for 200 cycles of autotracing. The best 
autotracing cycle resulted in 84 out of 113 residues in six chains. The 
resulting structure had a significant amount of secondary structure 
and revealed a /3-barrel with a single helix. 

2.4.3. Model completion and refinement and structure analysis. 
The resulting polyalanine structure and phases were fed into 
phenix.autobuild (Adams et at, 2010; Terwilliger et at, 2008) along 
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with the amino-acid sequence for automatic model completion using model was completed manually in Coot (Emsley et ai, 2010). For the 
default settings. The resulting model had 96 out of 113 residues (84%) high-resolution synchrotron data, free-i? flags were copied from the 
built in sequence and R work of 23.51% and R [ret , of 26.02%. This data set used for phasing and extended. The model was refined with 




Figure 1 

(a) Cartoon representation of VapB coloured from dark blue to red from the N-terminus to the C-terminus. The two views are rotated 180^ around a vertical axis, (b) 
Electrostatic potential of VapB oriented as in («) and coloured on a scale from —15 (red) to +15 (blue) kTle. (c) Distribution of hydrophilic (left) and hydrophobic (right) 
residues in VapB oriented as in the left of (a). 
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Table 2 

Structure solution and refinement. 



Values in parentheses are for the outer shell. 



Resolution range (A) 


50-1.4 (1.436-1.400) 


Completeness (%) 


99.95 (99.87) 


No. of reflections 




Working set 


18526 (1345) 


Test set 


2030 (147) 


Final R CTyst (%) 


17.0 (22.7) 


Final R, m (%) 


19.9 (26.9) 


No. of non-H atoms 




Protein 


915 


Ion 


2 


Water 


107 


Total 


1024 


R.m.s. deviations 




Bonds (A) 


0.024 


Angles (°) 


2.234 


Average B factors (A 2 ) 




Protein 


20.4 


Ion 


22.5 


Water 


29.2 


Ramachandran plot 




Most favoured (%) 


96.8 


Allowed (%) 


2.4 


Outliers (%) 


(1.8 



REFMAC5 (Murshudov et aL, 2011) using TLS refinement with 12 
TLS groups that had been determined with the TLSMD server 
(Painter & Merritt, 2006). Refinement statistics are given in Table 2. 
The oligomeric state was determined with PISA (Krissinel & 
Henrick, 2007). Structural similarities were identified with the DALI 
server (Holm & Rosenstrom, 2010) and PDBeFold (Krissinel & 
Henrick, 2004). PDBeFold was run with default settings except 
setting 'lowest acceptable match' to 60% for both query and target. 
Figures were prepared with PyMOL (DeLano, 2002). Sequences 
were aligned with MAFFT (Katoh & Standley, 2013). Alignment 
figures were prepared with ESPript (Gouet et aL, 2003). 



3. Results and discussion 

3.1. Structure determination of the highly conserved protease- 
resistant Vap core 

We first tried to crystallize VapA, as its role in virulence has been 
most clearly documented. Full-length VapA was degraded into a 
stable fragment, presumably by trace amounts of proteases remaining 
after purification. In an attempt to completely degrade purified VapA 
to serve as negative control in cellular assays, we had fortuitously 
found that VapA has a proteinase-K-resistant core consisting of the 
highly conserved C-terminus (Supplementary Fig. SI 1 ). We purified 
this proteinase-K-resistant fragment for crystallization, but did not 
obtain any diffracting crystals. Therefore, we turned to VapB, the Vap 
most similar to VapA. Again, full-length VapB was unstable and did 
not crystallize. We used proteinase K digestion to remove the N- 
terminus. Edman sequencing (Proteome Factory, Berlin) revealed 
that the resulting 12 kDa stable fragment started at amino acid 
Glu85. We will refer to this fragment as VapB throughout this paper. 
Within the proteinase-K-resistant domains, the VapA and VapB 
amino-acid sequences are 88% identical plus 6% highly conserved. 
VapB crystallized in a condition containing 10 mM CoCl 2 from a 
commercial screen. We collected a high-redundancy data set on a 
copper sealed-tube diffractometer (Table 1). The anomalous signal 
originating from the S atom of the single methionine and two low- 

1 Supporting information has been deposited in the IUCr electronic archive 
(Reference: HV5258). 



occupancy (~25%) Co + ions was sufficient to solve the phase 
problem by SAD. Clear electron density is visible for residues Glu87- 
Vall96 with some density for residues Gln86 and Alal97. Apparently, 
there was no or incomplete cleavage of the five C-terminal VapB 
residues although they are accessible and in a loop conformation. 

3.2. Structure of VapB 

VapB forms a monomeric eight-stranded antiparallel /i-barrel with 
a single helix (Fig. la). We refer to the molecule as oriented in Fig. 
1(a) with both termini and the helix at the bottom and the turn 
connecting strands 1 and 2 and the loops between strands 5 and 6 or 7 
and 8 located at the top. The distribution of hydrophobic and polar 
residues on the VapB surface is not uniform. The bottom is enriched 
in polar side chains, including eight negatively and two of the three 
positively charged residues of VapB, while the top is more hydro- 
phobic (Figs, lb and lc). Conserved surface-exposed residues are 
presumably relevant for the fold and often form hydrogen bonds via 
their side chains, e.g. Tyrl02, Tyrl43 or Asnl59 and Asnl61 
(Supplementary Fig. SI). Vaps contain many glycine residues that are 
very well defined in the electron density. They often locate to /J- 
strands and are conserved for steric reasons, assuming backbone 
conformations that are not accessible to nonglycine residues, or 
because the addition of only a C 5 atom would result in clashes 
(Supplementary Fig. SI). It is questionable whether VapF, another 
member of the Vap family, can form a stable protein with the same 
fold, as it lacks highly conserved sequences forming strands 7 and 8 
(Supplementary Fig. S2). 

A search for similar structures with DALI returns mostly other 
eight-stranded and some ten-stranded /S-barrels but no highly 
significant hits (Z-scores < 6; sequence identity < 15%). Similar 
structures include bacterial outer membrane proteins (Schulz, 2002), 
proteins from the avidin/streptavidin superfamily (Livnah et ai, 1993; 
Weber et ai, 1989) and cytoplasmic fatty-acid-binding proteins 
(Zimmerman & Veerkamp, 2002). PDBeFold identified bacterial 
lysozyme inhibitors (Leysen et aL, 2011; Yum et aL, 2009) as top hits. 

VapB has a tightly packed core sealed at the bottom by the helix 
(Fig. 2a). There is neither a central pore typical of outer membrane 
porins nor is there a central cavity typical for fatty-acid-binding 
proteins. Functional similarity might be highest to avidins. Avidins 
bind biotin in a groove at the top of the molecule, when oriented like 
VapB (Fig. 2b ; Livnah et aL, 1993; Weber et ai, 1989). The loops at 
the top of VapB have higher B factors, indicating flexibility (Fig. 2c). 
The connections between strands 1 and 2 and strands 7 and 8 (resi- 
dues 99-101 and 181-184, respectively) display double conforma- 
tions. It is conceivable that these turns/loops move to open a mainly 
hydrophobic binding site formed by the top of the VapB hydrophobic 
core. A function of VapB in binding some kind of ligand thus seems 
possible. 

3.3. VapB consists of two Greek-key motifs with unprecedented 
topology 

While VapB is structurally related to bacterial outer membrane 
proteins, avidins, fatty-acid-binding proteins and lysozyme inhibitors, 
with root-mean-square deviations of around 3 A for some 70 aligned 
C" atoms, the topology of VapB is quite unusual. Outer membrane 
proteins, avidins, fatty-acid-binding proteins and lysozyme inhibitors 
all are antiparallel /3-barrels in which each strand is directly linked to 
its neighbour (Fig. 3a). In contrast, the VapB /3-barrel consists of two 
Greek-key motifs (Hutchinson & Thornton, 1993) with strands 
arranged in the order 41238567 and a helix connecting strands 4 and 5 
(Fig. 3b). Greek-key motifs are common in five- or six-stranded fi- 
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(a) (b) (c) 

Figure 2 

(a) View from the top down the axis of the VapB ^-barrel. Side chains of residues forming the hydrophobic core are represented as sticks, (b) Cartoon representation of 
streptavidin (PDB entry lstp; Weber etal, 1989) coloured from dark blue to red from the N-terminus to the C-terminus. Biotin is shown as spheres, (c) VapB coloured by B 
factor (blue, low; red, high) and oriented as streptavidin with respect to strand {31 (note that the structural alignment generated by DALI is different). The loops at the top 
exhibit high B factors and double backbone conformations for residues that are represented as sticks. 



Hiram 



(a) 



1 rx i a 



s 



(c) 



Figure 3 

Topology diagrams of eight-stranded antiparallel /^-barrels (a) with next-neighbour connections as found in outer membrane proteins, avidins and lysozyme inhibitors; (b) 
two Greek-key motifs as present in VapB; (c) two Greek-key motifs as described for RPB8 (adapted from Krapp et al, ' 
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barrels and in /3-sandwich structures (Zhang & Kim, 2000), for 
example the fi- and y-crystallins with two Greek-key motifs that form 
two four-stranded sheets (Jaenicke & Slingsby, 2001). A compre- 
hensive review of Greek-key motifs in /J-barrels stated that there is 
only one eight-stranded /3-barrel known that contains Greek keys 
(Zhang & Kim, 2000). This protein (RNA polymerase subunit RPB8; 
PDB entry laid; Krapp et al., 1998) has strand order 32148567 (Fig. 
3c) and is structurally unrelated to VapB. While we cannot exclude 
the fact that an eight-stranded /i-barrel with the strand order of VapB 
has been described before, such a topology is not found among the 
top hits of DALI or PDBeFold. 

In conclusion, we have determined the structure of VapB as 
representative of the R. equi Vap family. The Vaps fold into two 
Greek-key motifs forming an eight-stranded /3-barrel with an 
unusual, if not unprecedented, topology. The arrangement of strands 
is different from /i-barrels with next-neighbour connections present 
in outer membrane proteins, avidins, fatty-acid-binding proteins and 
lysozyme inhibitors and different from crystallins that also consist of 
two Greek-key motifs. Analysis of the structure and surface proper- 
ties suggests that Vaps might act as binding proteins involved in 
interaction with as yet unidentified ligands. 

We gratefully acknowledge access to beamline ID29 of the ESRF 
(Grenoble, France) and coverage of travel expenses through BAG 
MX-1421. We thank Roman Fedorov (Hannover Medical School) for 
data collection at ESRF ID29 and Stephan Barden for critical reading 
of the manuscript and help with figures. This study was supported by 
the Deutsche Forschungsgemeinschaft (grant SFB 670/TP3 to AH). 
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